Using Perturbed Handwriting to Support Writer ID in the Presence of Severe Data Constraints
نویسندگان
چکیده
Usually we assume large amount of data for writer identification, what if only little data is available, e.g., 1 page, 1 paragraph, or even 1 line? Several reasonable assumptions in forensics: • The writers are no longer available. • The writers might not be collaborative. The question is then: what can we do in this situation? • Use of synthetic data has been studies in fields like: signature verification, handwriting recognition. • In general, there are two methodologies involved: 1. Model Generated Handwriting (MGH)-build models to simulate people' s handwriting behavior [1,2]. 2. Model Perturbed Handwriting (MPG)-build models to manipulate people' s real handwriting, also known as deformation [3]. In this work, we only study model perturbed handwriting. Issues Techniques for handwriting recognition and writer ID share a lot in common, but: • Handwriting recognition engines strive for inter-writer commonalities, i.e., the characters/words people write. • Writer ID engines strive for inter-writer variances, i.e., the idiosyncratic styles people write in. Varga and Bunke propose a MPH model for handwriting recognition where four deformation methods are presented [1] :
منابع مشابه
Using perturbed handwriting to support writer identification in the presence of severe data constraints
Since real data is time-consuming and expensive to collect, label, and use, researchers have proposed approaches using synthetic variations for the tasks of signature verification, speaker authentication, handwriting recognition, keyword spotting, etc. However, the limitation of real data is particularly critical in the field of writer identification in that in forensics, enemies or criminals u...
متن کاملOffline Language-free Writer Identification based on Speeded-up Robust Features
This article proposes offline language-free writer identification based on speeded-up robust features (SURF), goes through training, enrollment, and identification stages. In all stages, an isotropic Box filter is first used to segment the handwritten text image into word regions (WRs). Then, the SURF descriptors (SUDs) of word region and the corresponding scales and orientations (SOs) are extr...
متن کاملText Dependent Writer Identification using Support Vector Machine
Writer identification is the process of identifying the writer of the document based on their handwriting. Recent advances in computational engineering, artificial intelligence, data mining, image processing, pattern recognition and machine learning have shown that it is possible to automate writer identification. This paper proposes a model for text-dependent writer identification based on Eng...
متن کاملText Dependent Writer Identification using Support Vector Machine
Writer identification is the process of identifying the writer of the document based on their handwriting. Recent advances in computational engineering, artificial intelligence, data mining, image processing, pattern recognition and machine learning have shown that it is possible to automate writer identification. This paper proposes a model for text-dependent writer identification based on Eng...
متن کاملPrediction of Writer Using Tamil Handwritten Document Image Based on Pooled Features
Abstract—Tamil handwritten document is taken as a key source of data to identify the writer. Tamil is a classical language which has 247 characters include compound characters, consonants, vowels and special character. Most characters of Tamil are multifaceted in nature. Handwriting is a unique feature of an individual. Writer may change their handwritings according to their frame of mind and t...
متن کامل